Choosing the Right Data Mining Technique: Classification of Methods and Intelligent Recommendation

نویسندگان

  • David A. Swayne
  • Wanhong Yang
  • A. A. Voinov
  • Karina Gibert
  • Miquel Sànchez-Marrè
  • Víctor Codina
چکیده

One of the most difficult tasks in the whole KDD process is to choose the right data mining technique, as the commercial software tools provide more and more possibilities together and the decision requires more and more expertise on the methodological point of view. Indeed, there are a lot of data mining techniques available for an environmental scientist wishing to discover some model from her/his data. This diversity can cause some troubles to the scientist who often have not a clear idea of what are the available methods, and moreover, use to have doubts about the most suitable method to be applied to solve a concrete domain problem. Within the data mining literature there is not a common terminology. A classification of the data mining methods would greatly simplify the understanding of the whole space of available methods. Furthermore, most data mining products either do not provide intelligent assistance for addressing the data mining process or tend do so in the form of rudimentary “wizard-like” interfaces that make hard assumptions about the user’s background knowledge. In this work, a classification of most common data mining methods is presented in a conceptual map which makes easier the selection process. Also an intelligent data mining assistant is presented. It is oriented to provide model/algorithm selection support, suggesting the user the most suitable data mining techniques for a given problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting the Next State of Traffic by Data Mining Classification Techniques

Traffic prediction systems can play an essential role in intelligent transportation systems (ITS). Prediction and patterns comprehensibility of traffic characteristic parameters such as average speed, flow, and travel time could be beneficiary both in advanced traveler information systems (ATIS) and in ITS traffic control systems. However, due to their complex nonlinear patterns, these systems ...

متن کامل

Intelligent Health Solution System

Introduction: In the field of management, the statistics and performance of the deputies and functions of the organization are always of great importance, which requires instant access to the latest status of the system under coverage and minimal forecast of the future situation, to provide quality services Also improve. All of this justifies the existence of an intelligent statistical system w...

متن کامل

Improved Product Ranking for Recommendation System

Data mining is extraction or mining the knowledge from large amount of data. There are many data mining methods used for recommendation system. Which are classification, clustering and association rule discovery. Real life data needs to be pre-processed, it means data cleaning, filtering and transformation is performed in order to be used by machine learning techniques in the analysis step . Co...

متن کامل

A Novel Intelligent Recommendation Algorithm based on Web Data Mining Technique under the Background of Deep Neural Network

The development of the Internet brought us into an era of big data information, give people bring convenient while and also make people ragged when choosing the required information and recommendation system arises at the historic moment, and get the wide attention and applications. Therefore, to enhance the traditional method, we propose a novel intelligent recommendation algorithm based on We...

متن کامل

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010